Feature Summary and Visualization Report


category_high_card

Metric Value
feature_name category_high_card
data_type character
n_unique_values 1500
is_primary_key TRUE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories ID_1, ID_2, ID_3, ID_4, ID_5, ID_6, ID_7, ID_8, ID_9, ID_10, ID_11, ID_12, ID_13, ID_14, ID_15
top_n_pct_value 0.07
top_n_pct 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Data Frequencies (from Binned Data)

Value Count Percentage
others 1425 95.00%
ID_1 1 0.07%
ID_2 1 0.07%
ID_3 1 0.07%
ID_4 1 0.07%
ID_5 1 0.07%
ID_6 1 0.07%
ID_7 1 0.07%
ID_8 1 0.07%
ID_9 1 0.07%
ID_10 1 0.07%
ID_11 1 0.07%
ID_12 1 0.07%
ID_13 1 0.07%
ID_14 1 0.07%
others 61 4.07%

category_moderate

Metric Value
feature_name category_moderate
data_type character
n_unique_values 10
is_primary_key FALSE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories A, G, H, B, E, I, C, J, D, F
top_n_pct_value 11.47
top_n_pct 11.47, 10.87, 10.73, 10.6, 10, 9.6, 9.53, 9.4, 9.2, 8.6

Data Frequencies (from Binned Data)

Value Count Percentage
A 172 11.47%
G 163 10.87%
H 161 10.73%
B 159 10.60%
E 150 10.00%
I 144 9.60%
C 143 9.53%
J 141 9.40%
D 138 9.20%
F 129 8.60%

character_comments

Metric Value
feature_name character_comments
data_type character
n_unique_values 3
is_primary_key FALSE
pct_logical_null 18.73
pct_char_null_na 0
pct_empty_string 19.93
top_n_categories NULL, Good, Bad, Okay
top_n_pct_value 38.67
top_n_pct 38.67, 21.8, 19.93, 19.6

Data Frequencies (from Binned Data)

Value Count Percentage
Good 327 35.54%
Bad 299 32.50%
Okay 294 31.96%

date_col

Metric Value
feature_name date_col
data_type date
n_unique_values 1500
is_primary_key TRUE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories 2023-01-01, 2023-01-02, 2023-01-03, 2023-01-04, 2023-01-05, 2023-01-06, 2023-01-07, 2023-01-08, 2023-01-09, 2023-01-10, 2023-01-11, 2023-01-12, 2023-01-13, 2023-01-14, 2023-01-15
top_n_pct_value 0.07
top_n_pct 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Data Frequencies (from Binned Data)

Bin Count Percentage
[2023-07-01 to 2023-09-30] 92 6.13%
[2023-10-01 to 2023-12-31] 92 6.13%
[2024-07-01 to 2024-09-30] 92 6.13%
[2024-10-01 to 2024-12-31] 92 6.13%
[2025-07-01 to 2025-09-30] 92 6.13%
[2025-10-01 to 2025-12-31] 92 6.13%
[2026-07-01 to 2026-09-30] 92 6.13%
[2026-10-01 to 2026-12-31] 92 6.13%
[2023-04-01 to 2023-06-30] 91 6.07%
[2024-01-01 to 2024-03-31] 91 6.07%
[2024-04-01 to 2024-06-30] 91 6.07%
[2025-04-01 to 2025-06-30] 91 6.07%
[2026-04-01 to 2026-06-30] 91 6.07%
[2023-01-01 to 2023-03-31] 90 6.00%
[2025-01-01 to 2025-03-31] 90 6.00%
others 129 8.60%

datetime_col

Metric Value
feature_name datetime_col
data_type POSIXct
n_unique_values 1500
is_primary_key TRUE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories 2023-01-01 10:00:00, 2023-01-01 10:10:00, 2023-01-01 10:20:00, 2023-01-01 10:30:00, 2023-01-01 10:40:00, 2023-01-01 10:50:00, 2023-01-01 11:00:00, 2023-01-01 11:10:00, 2023-01-01 11:20:00, 2023-01-01 11:30:00, 2023-01-01 11:40:00, 2023-01-01 11:50:00, 2023-01-01 12:00:00, 2023-01-01 12:10:00, 2023-01-01 12:20:00
top_n_pct_value 0.07
top_n_pct 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Data Frequencies (from Binned Data)

Bin Count Percentage
2023-01-02 144 9.60%
2023-01-03 144 9.60%
2023-01-04 144 9.60%
2023-01-05 144 9.60%
2023-01-06 144 9.60%
2023-01-07 144 9.60%
2023-01-08 144 9.60%
2023-01-09 144 9.60%
2023-01-10 144 9.60%
2023-01-11 120 8.00%
2023-01-01 84 5.60%

days_since_datetime_col

Metric Value
feature_name days_since_datetime_col
data_type integer
n_unique_values 11
is_primary_key FALSE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories 1498, 1497, 1496, 1495, 1494, 1493, 1492, 1491, 1490, 1489, 1499
top_n_pct_value 9.6
top_n_pct 9.6, 9.6, 9.6, 9.6, 9.6, 9.6, 9.6, 9.6, 9.6, 8, 5.6

Original Descriptive Statistics (Min/Max/Quartiles)

Statistic Value
Min. 1489.00
1st Qu. 1491.00
Median 1494.00
Mean 1493.88
3rd Qu. 1496.00
Max. 1499.00

integer_rating

Metric Value
feature_name integer_rating
data_type integer
n_unique_values 5
is_primary_key FALSE
pct_logical_null 1.33
pct_char_null_na 0
pct_empty_string 0
top_n_categories 3, 4, 5, 2, 1, NULL
top_n_pct_value 20.8
top_n_pct 20.8, 19.87, 19.67, 19.4, 18.93, 1.33

Original Descriptive Statistics (Min/Max/Quartiles)

Statistic Value
Min. 1.00
1st Qu. 2.00
Median 3.00
Mean 3.02
3rd Qu. 4.00
Max. 5.00

logical_flag

Metric Value
feature_name logical_flag
data_type logical
n_unique_values 2
is_primary_key FALSE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories TRUE, FALSE
top_n_pct_value 70.93
top_n_pct 70.93, 29.07

Data Frequencies (from Binned Data)

Value Count Percentage
TRUE 1064 70.93%
FALSE 436 29.07%

mixed_nulls_text

Metric Value
feature_name mixed_nulls_text
data_type character-logical
n_unique_values 2
is_primary_key FALSE
pct_logical_null 13
pct_char_null_na 37.67
pct_empty_string 24.33
top_n_categories NULL, Valid, Value
top_n_pct_value 75
top_n_pct 75, 12.73, 12.27

Data Frequencies (from Binned Data)

Value Count Percentage
Valid 191 50.93%
Value 184 49.07%

numeric_target

Metric Value
feature_name numeric_target
data_type numeric
n_unique_values 1500
is_primary_key TRUE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories 358.819768112153, 809.474621899426, 468.07922963053, 894.715663604438, 946.420555864461, 141.000849450938, 575.294939242303, 903.177139954641, 596.291513019241, 510.953261773102, 961.15001081489, 508.000740571879, 709.81357190758, 615.370061760768, 192.632214399055
top_n_pct_value 0.07
top_n_pct 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Original Descriptive Statistics (Min/Max/Quartiles)

Statistic Value
Min. 100.42
1st Qu. 324.38
Median 542.38
Mean 546.53
3rd Qu. 769.12
Max. 999.57

numeric_with_na

Metric Value
feature_name numeric_with_na
data_type numeric
n_unique_values 1450
is_primary_key FALSE
pct_logical_null 3.33
pct_char_null_na 0
pct_empty_string 0
top_n_categories NULL, 11.2256003776565, 91.7815666180104, 87.1537754312158, 64.1269314335659, 67.2126817749813, 83.7917572120205, 24.7047446900979, 97.752767498605, 87.9221294308081, 38.6420763097703, 33.675015042536, 51.256316783838, 54.5025422703475, 93.4922681190073
top_n_pct_value 3.33
top_n_pct 3.33, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Original Descriptive Statistics (Min/Max/Quartiles)

Statistic Value
Min. 0.06
1st Qu. 24.91
Median 50.44
Mean 50.47
3rd Qu. 76.27
Max. 99.99

primary_key

Metric Value
feature_name primary_key
data_type integer
n_unique_values 1500
is_primary_key TRUE
pct_logical_null 0
pct_char_null_na 0
pct_empty_string 0
top_n_categories 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
top_n_pct_value 0.07
top_n_pct 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07, 0.07

Original Descriptive Statistics (Min/Max/Quartiles)

Statistic Value
Min. 1.00
1st Qu. 375.75
Median 750.50
Mean 750.50
3rd Qu. 1125.25
Max. 1500.00